Statistics for Cost-Based XML Query Optimization
نویسندگان
چکیده
Cost-based query optimization (CBO) is a very important area for data management in general and for XML data management in particular. For native XML database management systems (XDBMS), CBO techniques are harder than for relational databases, because the underlying tree-based data model is much more complex and the relative order (document order) between XML elements (nodes) matters. In this paper, we present our first ideas on statistics data structures supporting CBO, that is, the mapping of a logical access model with algebraic operators to the physical access model. In our prototype system called XTC (XML Transaction Coordinator), the physical access model is embodied by a toolbox of path processing operators from which the best performing operators have to be selected based on statistic information for the construction of the query execution plan.
منابع مشابه
Query Optimization for XML
XML is an emerging standard for data representation and exchange on the World-Wide Web. Due to the nature of information on the Web and the inherent exibility of XML, we expect that much of the data encoded in XML will be semistructured: the data may be irregular or incomplete, and its structure may change rapidly or unpredictably. This paper describes the query processor of Lore, a DBMS for XM...
متن کاملFramework-Based Development and Evaluation of Cost-Based Native XML Query Optimization Techniques
Reflecting on the history of database management systems reveals that cost-based query optimization has been the dominating method for effectively answering complex queries on large documents. Native XML database management systems provide an efficient infrastructure for storing, indexing, and querying large XML documents. Even though such systems can choose from a huge set of structural join o...
متن کاملStatistical Learning Techniques for Costing XML Queries
Developing cost models for query optimization is significantly harder for XML queries than for traditional relational queries. The reason is that XML query operators are much more complex than relational operators such as table scans and joins. In this paper, we propose a new approach, called Comet, to modeling the cost of XML operators; to our knowledge, Comet is the first method ever proposed...
متن کاملVAMANA : A High Performance, Scalable and Cost Driven XPath Engine
Many applications are migrating or beginning to make use native XML data. We anticipate that queries will emerge that emphasize the structural semantics of XML query languages like XPath and XQuery. This brings a need for an efficient query engine and database management system tailored for XML data similar to traditional relational engines. While mapping large XML documents into relational dat...
متن کاملCost-based optimization in DB2 XML
A. Balmin T. Eliaz J. Hornibrook L. Lim G. M. Lohman D. Simmen M. Wang C. Zhang DB2 XML is a hybrid database system that combines the relational capabilities of DB2 Universal Databasee (UDB) with comprehensive native XML support. DB2 XML augments DB2t UDB with a native XML store, XML indexes, and query processing capabilities for both XQuery and SQL/XML that are integrated with those of SQL. Th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006